此次資料採用kaggle網站提供的google play store APP的資料, 先行去除無效或錯誤的資料後進行分析,主要分析應用程式的評分分布。
library(ggplot2)
library(plotly)
##
## Attaching package: 'plotly'
## The following object is masked from 'package:ggplot2':
##
## last_plot
## The following object is masked from 'package:stats':
##
## filter
## The following object is masked from 'package:graphics':
##
## layout
#data from https://www.kaggle.com/lava18/google-play-store-apps
mydata = read.csv("googleplaystore.csv")
在分析之前,先整理此次所採計的資料。 - 採計的應用程式共9244項,共33種類型,以Family家庭類型最多,其次為Game遊戲、再者為Tools工具類型。
plot_ly(mydata, x = ~Category, color = ~Category, type = "histogram")
my.plot3 <- ggplot(mydata, aes(x = Rating))
my.plot3 <- my.plot3 +
geom_histogram(binwidth = 0.1, fill = "steelblue")
my.plot3
plot_ly(mydata, x = ~Rating, color = ~Category, type = "box")